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ABSTRACT 

The Protein Data Bank in Europe (PDBe; pdbe.org) is 
a partner in the Worldwide PDB organization 
(wwPDB; wwpdb.org) and as such actively involved 
in managing the single global archive of biomacro- 
molecular structure data, the PDB. In addition, PDBe 
develops tools, services and resources to make 
structure-related data more accessible to the bio- 
medical community. Here we describe recently 
developed, extended or improved services, includ- 
ing an animated structure-presentation widget 
(PDBportfolio), a widget to graphically display the 
coverage of any UniProt sequence in the PDB 
(UniPDB), chemistry- and taxonomy-based 
PDB-archive browsers (PDBeXplore), and a tool for 
interactive visualization of NMR structures, corres- 
ponding experimental data as well as validation and 
analysis results (Vivaldi). 

INTRODUCTION 

Since the early 1970s, the Protein Data Bank (PDB) has 
been the single global archive in which 3D structure infor- 
mation about biomacromolecules (including complexes) 
is archived (1,2). Since 2003, the PDB archive has been 
managed by an international organization called the 
Worldwide PDB (wwPDB; wwpdb.org) (3,4). It consists 
of the Research Collaboratory for Structural 
Bioinformatics (RCSB) (5) and the BioMagResBank 
(BMRB) (6) in the USA, the Protein Data Bank Japan 
(PDBj) (7) and the Protein Data Bank in Europe (PDBe; 
pdbe.org) (8,9). The four wwPDB partners accept and 
process depositions of new structures and supporting ex- 
perimental data and jointly curate, remediate and 



distribute the PDB archive. They also work together 
(often in consultation with the community) to define de- 
position and annotation policies and procedures, file 
formats, descriptions of chemical compounds and 
polymer components, and validation standards for struc- 
tural data. In addition, each of the partners offers inde- 
pendent services to users of structural information. PDBe 
aims to develop tools, services and resources that help 
make the wealth of data about biomacromolecular struc- 
ture and function more easily accessible to the wider bio- 
medical community (10). Many of these tools have been 
described recently (8,9). In this article, we briefly describe 
several recently developed or enhanced services provided 
by PDBe. 

PDBportfolio: HIGHLIGHTING SALIENT FEATURES 
OF A PDB ENTRY 

In order to convey salient features and annotation 
in the context of 3D structure, PDBe has de- 
veloped an animated widget called PDBportfolio 
(pdbe.org/portfolio), Figure 1. It presents a slide show 
of images that convey important information and 
value-added annotation about a selected PDB entry or 
entries. The legend of each image contains more details 
as well as links to relevant web pages at PDBe or external 
resources. The slide show covers: 

• Quaternary structure — the largest assembly identified 
by the depositors or PISA (11). 

• Deposited model — a cartoon and a surface representa- 
tion are shown separately. The cartoon is coloured 
by polymer chain and shown with non-polymeric 
entities as space-filling (CPK) models. The surface is 
coloured by atomic properties using some simple rules 
as defined in PyMol (pymol.org) (12). In the case of 
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Figure 1, The PDBportfolio widget provides images and annotation about important aspects of one or more PDB entries, including the quaternary 
structure, domains and bound ligands. Refer to the text and pdbe.org/portfolio for more information, or see pdbe.org/lfcc for a live animated 
example. 



protein-DNA/RNA complexes, the image shows only 
the protein surface for clarity. 

Domain structure — separate images show SCOP (13), 
CATH (14) and Pfam (15) domains as annotated by 
the SIFTS resource (16). Each domain is highlighted 
using a coloured cartoon and its boundaries are 
further highlighted by a semi-transparent surface of 
the same colour. Different surface styles are used to dis- 
tinguish multiple occurrences of the same domain type. 



Ligands — the binding environment of at most three 
bound chemical compounds is shown. Compounds 
that are most likely crystallization agents (such as 
glycerol) are ignored. 

Experiment-dependent information — for X-ray crystal 
structures, temperature-factor information is shown on 
the structure and red surface patches indicate where 
crystal contacts occur. For NMR entries, the entire 
ensemble of models is shown. For 3DEM entries, the 
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Figure 2. The UniPDB widget provides a graphical overview of the sequence coverage of any UniProt entry in the PDB. This example (pdbe.org/ 
unipdb?uniprot = FAS_HUMAN) shows the domain architecture of human fatty acid synthase (FAS), a multi-enzyme assembly line of fatty acids. 
Pfam identifies eight domains in human FAS, five of which have structural coverage in the PDB. Salient features of the relevant PDB entries are 
depicted using PDBlogos. Refer to the text and pdbe.org/unipdb for more information. 



EMDB (17,18) map is shown if available, with the 
PDB entry fitted into it. 

PDBportfolio is used to display information about 
every PDB entry on its PDBe Atlas page (e.g. pdbe.org/ 
lcbs). The widget can also be used freely in external 
web pages to convey key information about one or more 
PDB entries. The control buttons on the interface allow 
users to manipulate the slide show. They may also 
download an archive with all the PDBportfolio images 
of an entry (as well as the PyMol scripts used to 
generate them), or view all images and legends in one 
web page. 

UniPDB: UniProt-PDB SEQUENCE COVERAGE 

UniPDB is a widget that provides a graphical display of 
the sequence coverage in the PDB of any UniProt (19) 
entry (pdbe.org/unipdb), Figure 2. Proteins encountered 
in PDB entries may contain partial sequences (e.g. one or 
more stably folded domains), chimeric sequences, fusions 
with other proteins and all manner of modifications to the 
wild-type sequence. Some modifications occur naturally, 
whereas others are introduced by experimenters to facili- 
tate purification or crystallization, or to allow investiga- 
tion of the effect of a mutation on the behaviour of the 
protein (such as catalytic activity or ligand-binding speci- 
ficity). In addition, the structure of a protein may 
have been determined many times, e.g. in different 
laboratories, using different techniques, under different 
conditions, or in complex with different ligands or other 
biomacromolecules. UniPDB provides an intuitive, graph- 
ical overview of the structural information available for a 
particular UniProt entry, based on mapping data provided 
by the SIFTS resource (16). It also provides annotation 
of the Pfam (15) domains occurring in the sequence. 



For every PDB entry that contains (a part of) the select 
UniProt sequence, PDBlogos (9) instantly reveal if it is an 
X-ray, NMR or EM structure, if it contains DNA or 
ligands, etc. From the UniPDB widget, a FASTA search 
of the PDB for related sequences can be launched; the 
results of this search are presented in the PDBeXplore 
browser (9). 

PDBeXpress: PDB ANALYSIS TOOLS 

PDBeXpress (pdbe.org/express) is an umbrella name for a 
collection of easy-to-use and powerful PDB analysis tools. 
Most of these use the PDBeMotif (20) web service as the 
underlying tool, while others access the PDBe search 
database directly. At present, there are two PDBeXpress 
modules in production and several others are under devel- 
opment. The first two modules can be used to answer the 
following common questions: 

• What residues are found in the binding sites of a given 
compound? Using PDBeMotif, PDBeXpress retrieves 
the residues with which a ligand interacts as 
observed in current PDB entries. A ligand can be 
selected by providing its name or three-character 
PDB identifier. The results are presented as a graph 
that shows the relative occurrence of the amino acids 
in the binding sites of the compound (extracted 
from the PDBeMotif database), Figure 3. There are 
options to view the PDB entries in which these inter- 
actions occur, or to perform further analyses using 
PDBeMotif. The graphs and the data can be 
downloaded. 

• What compounds are known to bind a given set of 
residues? PDBeMotif is used to retrieve all ligands 
observed in the PDB to interact with a given set of 
amino acids and the results are again shown in an 
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Figure 3. PDBeXpress (pdbe.org/express) can be used to find out which residues are observed to interact with a certain ligand in entries in the PDB. 
The graph shown here is for ATP and shows a preponderance of Gly, Lys, Ser and Thr residues, as expected in the Walker ATP-binding motif (21). 
The graph also shows other commonly observed residues in ATP binding sites such as Arg, Asp and Glu. 



interactive graph. This tool can be used to generate 
hypotheses about the type of compounds that could 
conceivably bind in a pocket or cavity, given the 
nature of the residues that line it. 



EM-RELATED RESOURCES 

The Electron Microscopy Data Bank (EMDB) was estab- 
lished at the EBI in 2002 (17) and is now managed and 
developed in collaboration with the RCSB and Baylor 
College of Medicine (18). In addition to the joint 
EMDB portal (EMDataBank.org), there are some EM- 
related resources at PDBe as well, which have recently 
been reorganized, restyled and expanded (pdbe.org/ 
emdb). The data held in EMDB constitute a treasure 
trove of information on the state of, and trends in, the 
3DEM field. Examples of interesting information that 
can be mined from the archive include trends in the reso- 
lution of EM studies and the size of the structures that 
have been deposited. Specialist users may also be inter- 
ested in comparisons of the relative popularity of micro- 
scopes and software packages. EMstats (pdbe.org/ 
emstats) is a new service that mines the database for 
such information and presents the results as interactive 
charts that are generated dynamically and represent the 
current state of the information in the database. The 
graphical elements of the charts (pie diagrams, histo- 
grams, etc.) are active, which means that clicking on 
them results in a query to the database, the results of 
which are shown below the chart, Figure 4. 

NMR-RELATED RESOURCES 

PDBe provides a variety of NMR-related data to the sci- 
entific community (pdbe.org/nmr) (8,9). A statistics page 



listing the number of NMR entries in the PDB for which 
additional information is held at PDBe or elsewhere is 
now available and updated weekly. In the past year, we 
have added access to the logRECOORD database (22) 
that contains recalculated structures (using a log-normal 
potential for interpreting NOEs) for more than 300 NMR 
entries in the PDB. 

Vivaldi (Hendrickx et al., manuscript in preparation) is 
an interactive graphical web tool aimed at both expert and 
non-expert users of NMR structural data (pdbe.org/ 
vivaldi), Figure 5. It allows visualization of NMR ensem- 
bles and individual structures together with associated ex- 
perimental data (such as chemical shifts, distance 
restraints and residual dipolar couplings, RDCs) and 
derived validation-related information. The latter is 
partly generated using the PDBe services OLDERADO 
(23) and VASCO (24) and partly extracted from the 
external NRG-CING database (nmr.cmbi.ru.nl/ 
NRG-CING). Vivaldi uses the OpenAstexViewer (25) to 
present 3D displays of one or more models from the 
ensemble, Figure 5A. By default, the most representative 
model as identified by OLDERADO is shown. A separate 
interactive ID graph displays any of a variety of valid- 
ation scores or counts, such as the number of distance- 
restraint violations per residue, deviations of the chemical 
shift values from statistical averages as reported by 
VASCO, Figure 5B, or the fit between calculated and de- 
posited RDCs. The ID and 3D displays are coupled, 
which means that analysis and validation results can be 
inspected simultaneously as a function of residue number 
and in the context of the 3D model. Both the 3D structure 
views and the ID graphs can be saved as high-resolution 
images for use in publications or presentations. 
In addition, an information panel offers explanations of 
the different views in plain English text as well as detailed 
residue-specific information. Vivaldi also has a user- 
friendly 'wizard' option to help users obtain a particular 
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Figure 4. Molecular weight (MW) distribution and MW trends of the assemblies (single-particle and icosahedral) whose EM maps have been 
deposited in the EMDB archive, as generated with EMstats (pdbe.org/emstats). The distribution chart shows that only 22% of the studies are of 
assemblies with MW<0.5MDa. This is a consequence of the technical difficulties encountered when using the single-particle technique to study 
low-molecular-weight molecules or complexes. However, the trend charts show that the number of such studies is gradually increasing, which 
suggests that the field is learning how to overcome the experimental difficulties. 



view, subject to data availability. Some of these views are 
also accessible directly from the PDBe Atlas pages of 
NMR entries as well as from OLDERADO and 
VASCO report pages. 

OTHER NEW OR IMPROVED SERVICES 

Chemistry and taxonomy-based structure browsers 

PDBeXplore (pdbe.org/explore) is a browsing interface 
for retrieving and analysing information on subsets 
of structures in the PDB using various biological and 
chemical classifications (9). Previously released 
PDBeXplore modules enable browsing of the contents of 
the PDB based on Enzyme Class (26) (pdbe.org/ec), 
CATH domains (14) (pdbe.org/cath), Pfam families (15) 
(pdbe.org/pfam) or FASTA-based (27) sequence- 
similarity searches (pdbe.org/fasta). These browser 
modules retrieve results much faster than before and 
they have all been updated to include clickable pie 
charts that allow further refinement of the queries. In 
addition, two new browser modules have been released. 
A chemistry-based module (pdbe.org/compounds) enables 
analysis of all PDB entries that contain a chemical 
compound, while a taxonomy-based module (pdbe.org/ 
taxonomy) allows users to retrieve and analyse all 
protein structures in the PDB for any taxonomy level. 
Taxonomy information is taken from the well-established 



NCBI taxonomy database (28,29). The browser module 
also provides easy access to the top 15 species present in 
the largest number of PDB entries. 

Atlas entry pages 

PDBe Atlas pages provide a summary of a PDB entry in a 
user-friendly lay-out and serve as a starting point for 
further exploration of sequence, structure, chemistry and 
function information related to that entry. The summary 
Atlas pages have been improved with several 'action 
buttons' that allow one-click access to commonly used 
functionality [e.g. downloading the PDB file, viewing the 
structure in 3D, launching PDBeFold (30) to find similar 
structures in the PDB, or accessing the PISA (11) results]. 
The summary Atlas pages now also contain a table that 
lists all UniProt entries contained in the entry and action 
buttons to launch either a sequence search of the entire 
PDB or the UniPDB widget (see above) for each of these 
UniProt entries. The ligand Atlas pages now provide links 
to the ChEMBL (bioactivity data; https://www.ebi.ac.uk/ 
chembldb/) (31) and ChEBI (chemical annotation; www. 
ebi.ac.uk/chebi/) (32) resources at the EBI. Finally, the 
experiment-related Atlas pages for NMR entries now 
contain links to any NMR-related resources at PDBe 
and BMRB as well as to the Vivaldi viewer for interactive 
analysis of the structure in the light of experimental and 
validation-related data (see above). 
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Figure 5. Use of Vivaldi to display and analyse PDB data about NMR entries [in this example, entry 2KNR which has been discussed by Lemak 
et at. (33): pdbe.org/vivaldi/2knr]. (A) Results of OLDERADO (22) and VASCO (23) analysis of the 2KNR ensemble presented in a 3D structure 
display. OLDERADO clusters the members of an ensemble of NMR models. The representative model of the largest cluster is shown as a cartoon. 
The colouring of the models signifies the various structural domains indentified by OLDERADO. This analysis indicates that while the central 
P-sheet and the N-terminal oc-helix (all in orange) are rigid with respect to each other, the second ex-helix (blue) forms a separate rigid body or 
'domain'. Atoms with unusual chemical shifts as identified by VASCO are shown as van der Waals spheres. In addition, aromatic residues are shown 
as sticks. This structure exhibits a number of chemical shift outliers that cluster together. However, many of these outliers are near aromatic rings, 
which are known to strongly affect the chemical shifts values of neighbouring atoms due to ring currents. (B) ID graph of the deviation of chemical 
shift values from statistical averages as judged by VASCO as a function of residue number. The height and colour of the bars convey the extent of 
the deviation. This graph shows that most deviations are within usual bounds (green), but some of the outliers (yellow and red) are severe (up to 10 
SD) and require a plausible explanation, such as the presence of a nearby aromatic ring. 



Miscellaneous features 

Quips ('Quite Interesting Pdb Structures'; pdbe.org/quips) 
are short stories about one or more interesting or topical 
structures, coupled with an interactive viewer and, often, a 
tutorial that allows the reader to carry out more detailed 



exploration of a structure using PDBe resources, Figure 6. 
The interactive structure displays comprise a number of 
predefined (often animated) views to highlight concepts 
explained in the text. The tutorials assume that the 
reader has a background in biology, chemistry or 
medicine and an interest in proteins, nucleic acids and 
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Autotaxin (ATX) is a secreted glycoprotein that has 
been implicated in various physiological and 
pathophysiological processes, including vascular and 
neural development, lymphocyte trafficking, fibrosis 
and tumour progression. ATX functions as 
lysophospholipase D (lysoPLD), hydrolysing 
lysophosphatidylcholine (LPC) into the signalling lipid 
lysophosphatidic acid (LPA). LPA stimulates cell 
migration, proliferation and survival in many cell types 
by binding to specific G protein-coupled receptors 
(GPCRs). Given its emerging role in disease, ATX is 
actively pursued as a drug target in both academia 
and industry. 

3D structures of autotaxin in the PDB 

ATX is a -100 kDa protein that belongs to the 
ectonucleotide pyrophosphatase/phosphodiesterase 
(ENPP) enzyme family. ATX is unique in that it is the 
only member with lysoPLD activity (EC 3.1.4.39). It 
removes the choline moiety from the LPC, which can 
vary both in the length and saturation of the acyl 
chain. 

Recently, two teams of researchers have determined 
crystal structures of ATX. The Perrakis/Moolenaar 
groups solved the rat ATX structure in the apo form 
and in complex with an inhibitor (PDB entries 2xr9 
and 2xrg). The Mureki/Aoki groups solved the 
structure of mouse ATX, also in the apo form and in 
complex with LPA products of differing length and 
saturation of the acyl chain (PDB entries 3nkr, 3nkp. 
3nkq, 3nkm. 3nko and 3nkn). 
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Choose View [inhibitor binding in the catalytic site. The 

nhibilor HA155 makes numerous hydrophobic interactions with residues in the 
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Figure 6. Quips (pdbe.org/quips) is an interactive medium to develop stories about biomacromolecular structures. A Quips article contains links 
that activate the OpenAstex Viewer applet to highlight and animate important facets of 3D structures. The text contains marked-up links to 
relevant literature, PDB entries, ligands, glossary terms, tutorials, further exploration options, etc. The example shown here (pdbe.org/ 
quips?story = Autotaxin) accompanies a review on autotaxin structure and function (34). The review along with the interactive Quips provides 
users with better understanding of the topic when compared to static images. 



ligand interactions. New Quips articles are added about 
once a month. 

PDB highlights pages (pdbe.org/highlights) reveal PDB 
entries that are extreme in one sense or another, such as 
their age, their resolution, the number or length of the 
macromolecules contained in them, etc. Extreme entries 
can be listed separately for X-ray, NMR and EM struc- 
tures or for the entire PDB archive. 

Weekly updates of the PDB and EMDB archives can be 
monitored conveniently at pdbe.org/latest. This service 
provides lists of new, modified and removed PDB 
entries, of new and modified chemical compounds in the 
PDB, and of newly released and modified EMDB maps or 
summary files ('headers'). Every entry is shown in a panel 
with core information and an image; the panels can be 
expanded to reveal more information as well as action 
buttons to access commonly used services or files. Result 
lists can be downloaded as easily parsable text files, and 
new RSS feeds provide similar information (see pdbe.org/ 
rss for a list of available feeds). 
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